
SCALABLE MONITORING METHOD AND DEVICE 

^» ThKfirp^ent invention relates to a method for the scalable monitoring of a running 
computer system. The^cmTiptttaiLSystemis constituted by a set of computer equipment units 
5 called distributed resources, each being interconHe5ted4ajhe others through a communication 

Large companies have an increasing number of units of computer equipment to 
manage. These units, connected to one another through a communication network called a 
local area network (LAN), are managed by a manager. In order to manage (supervise, act on, 

]0 monitor, control) these units remotely from one point, an architectural model comprising a 
manager and an agent, for example of the SNMP type, is most commonly adopted. In this 
architectural model, the (SNMP) agents implemented in the equipment units of the network 

%Q inform the manager of the status of each of the units managed. In the vast majority of cases, 

s /j this manager manages several thousand units of distributed equipment in one or several 

fS countries. 

p In order to characterize the behavior of a set of resources interconnected through a 

] r ' network, any monitoring method synthesizes, and therefore calculates values known as 
O indicators, which make it possible to characterize the operation and the status of these 
iH resources. The calculation of an indicator is performed by an agent called an "indicator 
2g agent," which periodically calculates a value resulting from a function that combines values 
!= measured by means of an SNMP or CMIP protocol layer with values calculated by other 
indicator agents. 

itionally, this monitoring is performed by the manager, which centralizes the 
acquisition of tn&HQeasurements and the calculation of the indicators. The information 
25 exchanged between thel«^nager and the resources flows through a wide area network, also 
called a "WAN." However, tn&s^st of bandwidth in the WAN is not adapted to monitoring. 
Likewise, the exchanges of informati&HsHivolved in centralized monitoring generate heavy 
usage of the WAN. This problem is explainetkbv the fact that the bandwidth of the WAN is 
too small for the ever-increasing amount of informlttkm that must pass between the managers 
30 and their equipment. Therefore, the object of the present in^sqtion is to eliminate the 

drawbacks of the prior art by offering a scalable monitoring methcJ&^hat makes it easy to 
reorganize the architecture of the computer system when it changes. 
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This object is achieved through the fact that the method for the scalable monitoring of 
a computer system comprising a plurality of computer equipment units constituting hardware 
resources to be monitored forming a monitored domain, the method being implemented by 
means of a central computer system called a manager connected to a communication network 
5 that allows the transfer of information between at least one resource and the manager, is 
characterized in that it comprises: 

a ste p loi bre aking U1 6 lliuiiilur e ^d oiBaindown into monitored subdomains 
comprising a predetermined maximum number of resourcesT^^--^.^^ 

- a step for automatically creating and configuring, for each subdomain, an 

10 information synthesis node comprising at least one synthesis agent stored in the storage 
means of a resource; each synthesis agent is designed to synthesize indicator values 
calculated and stored in the storage means of at least one resource, these indicators 
=0 representing an operational status of the resources of the subdomain and being evaluated by 
= J\ indicator agents installed in these resources, each indicator agent being uniquely identified by 
^f5 the name of the indicator it calculates and by the subdomain in which it is installed and being 
13 associated with each synthesis agent using the corresponding indicator value, 
" p - a step for modifying the associations between the synthesis agents and the indicator 

O agents when the predetermined maximum number of resources in a subdomain is reached, in 
O order to accommodate the addition or deletion of indicators so that the new architecture of the 
25 monitored domain comprises, in each subdomain, a number of resources lower than the 
j« predetermined maximum number of resources. 

According to another characteristic, the step for configuring a synthesis node 
comprises, for each synthesis agent: 

- a step for searching, in a table stored in the storage means of a resource, for the 

25 name of the indicator agent or agents required to calculate the indicator of the synthesis agent 

- a step for the subscribing the synthesis agent to the indicator agents found during the 
search step, this subscription step allowing each synthesis agent to automatically receive, in 
its subscription table stored in the storage means of a resource, the new values of the 
indicators found. 

30 According to another characteristic, the search step comprises: 

- a step for the sending by the synthesis agent of a notification to a naming service 
dedicated to storing the associations between a subdomain name, an indicator agent and an 
indicator, this notification comprising the name of a given subdomain and a given indicator. 



According to another characteristic, the modification step comprises: 

- a step for installing at least one indicator agent in each new resource added to a 
subdomain, 

- a step for sending, to the synthesis agents requiring the value of the indicator of the 
new indicator agent or agents, a notification comprising the identification of the new 
indicator agent or agents, 

- a step for subscribing each synthesis agent to the new indicator agents required to 
calculate the indicator of the synthesis agent. 

According to another characteristic, the modification step comprises: 

- a step for selecting, for each subdomain, the resources to be deleted 

- a step for sending, to the synthesis agents using the value of the indicator of the 
indicator agent or agents installed in the selected resource or resources, a notification 
comprising the identification of the deleted indicator agent or agents 

- a step for unsubscribing the synthesis agents from the indicator agents whose 
indications are contained in the notification. 

According to another characteristic, the maximum number of resources per 
subdomain is determined either so that the cost of calculating the indicators is as low as 
possible, or so that the number of synthesis nodes is as low as possible. 

A second object of the invention consists of offering a scalable monitoring device that 
makes it easy to reorganize the architecture of the monitoring system when the size of the 
monitored domain, i.e. the number of elements to be monitored, changes. 

This object is achieved through the fact that the device for the scalable monitoring of 
a computer system comprising a plurality of computer equipment units constituting hardware 
resources to be monitored forming a monitored domain is characterized in that it comprises 
mans for breaking the monitored domain down into monitored subdomains comprising a 
predetermined maximum number of resources, means for creating and configuring, in the 
storage means of a resource, information synthesis nodes comprising at least one synthesis 
agent stored in the storage means of at least one resource and designed to synthesize indicator 
values calculated and stored in the storage means of a resource, these indicators representing 
an operational status of the resources of the subdomain and being evaluated by indicator 
agents installed in these resources, each indicator agent being uniquely identified by the name 
of the indicator it calculates and by the subdomain in which it is installed, the configuration 
of a synthesis agent comprising the storage, in the storage means of a resource, of the 


associations between the synthesis agent and indicator agents, means for modifying the 
associations between the synthesis agents and the indicator agents when the predetermined 
maximum number of resources in a subdomain is reached, so that the new architecture of the 
monitored domain comprises, in each subdomain, a number of resources lower than the 
predetermined maximum number of resources. 

According to another characteristic, the means for configuring a synthesis node 
comprise means for searching, in a table stored in the storage means of a resource, for the 
name of the indicator agent or agents required to calculate the indicator of the synthesis 
agent, and means for subscribing the synthesis agent to the indicator agents found during the 
search step, these subscription means allowing each synthesis agent to automatically receive, 
in its subscription table stored in the storage means, the new values of the indicators found. 

According to another characteristic, the search means comprise means for the sending 
by the synthesis agent of a notification to a naming service dedicated to storing, in a table 
stored in the storage means of a resource, the associations between subdomain name, an 
indicator agent and an indicator, this notification comprising the name of a given subdomain 
and a given indicator, and means for the sending by the naming service of a notification to the 
requesting synthesis agent, comprising the name of the indicator agent or agents 
corresponding to the association of the given subdomain and the given indicator. 

According to another characteristic, the modification means comprise means for 
creating and storing at least one indicator agent in each new resource added to a subdomain, 
means for sending, to the synthesis agents requiring the value of the indicator of the new 
indicator agent or agents, a notification comprising the identification of the new indicator 
agents or agents, means for subscribing each synthesis agent to the new indicator agents 
required to calculate the indicator of the synthesis agent. 

According to another characteristic, the modification means comprise means for 
selecting, for each subdomain of the resources to be deleted, means for sending, to the 
synthesis agents using the value of the indicator of the indicator agent or agents installed in 
the selected resource or resources, a notification comprising the identification of the deleted 
indicator agent or agents, and means for unsubscribing the synthesis agents from the indicator 
agents whose identifications are contained in the notification. 

According to another characteristic, the maximum number of resources per 
subdomain is determined either so that the cost of calculating the indicators is as low as 
possible, or so that the number of synthesis nodes is as low as possible. 



Other characteristics and advantages of the present invention will emerge more 
clearly through the reading of the following description, given in reference to the attached 
drawings, in which: 

- Fig. 1 represents a simplified schema of a computer system in which the method 
5 according to the invention can be applied; 

^^^> - Fig. ZTgpresejiisjui^e ^ of the links between monitoring agents 

according to the method of the invention 

- Fig. 3 icp i usents . the - p r ucess fui depluying a monitoring method. * 
XTSxplaijied above, a computer system (1) comprises at least one local area network 
(10, 20) that communicate?^hb*^central system (2) or manager through a wide area network 
(3). Each local area network (10, 20) compitses^least one unit of computer equipment (101, 
_ 102, 201, 202) called a resource. 
*P In the prior art, the monitoring of all the resources (101, 102, 201, 202) is handled by 

\-j means of remote agents (SNMP or CMIP agents, depending on the management protocol in 
yi question) installed in each resource (101, 102, 201, 202). These agents allow a manager to 
O collect information on the resources (101, 102, 201, 202) by measuring given parameters, for 

example by sending an SNMP request GetRequest to the SNMP agent in question, then to 
^ transmit the collected information to the manager (2) by means of a request, for example 
□ GetResponse in the SNMP world. The manager (2) then evaluates the indicators from this 
jSjj received information. It is understood that all the requests sent by the manager (2) in order to 
gather monitoring information and the responses to these requests from the agents produces 
considerable congestion in the wide area network (3), consequently diminishing its 
performance. 

According to the invention, all or part of the evaluation of the indicators is distributed 

25 at the local area network level (10, 20) to indicator agents. 

To do this, for each indicator (I n ) characterizing a particular operation or status of a 
resource (101, 102, 201, 202), a specific indicator agent (Al, A2, Bl, B2) is installed in the 
resource (101, 102, 201, 202). In the exemplary embodiment represented in Fig. 1, each agent 
(Al, A2, Bl, B2) is responsible for evaluating a different indicator (I n ). In other words, each 

30 indicator (I n ) makes it possible to determine a different operation or a different status in each 
resource (101, 102, 201, 202). However, each agent (Al, A2, Bl, B2) described in reference 
to Fig. 1 can be present, for example, in all of the resources (101, 102, 201, 202) or all the 
resources of the same type in the computer system (1). 
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For example, a first agent (Al) can evaluate an indicator (I A i) whose formula makes it 
possible to know, for example, the number of alarms per minute emitted by a first resource 
(101). It is understood that this agent (Al) can be installed in all the resources (101, 102, 201, 
202) of a computer system (1). 
5 The indicator agents (Al, A2, Bl, B2) are elementary programming and execution 

entities. The indicator agents (Al, A2, Bl, B2) are autonomous in order to give the structure 
the property of scalability by making it easy to add or delete one or several indicator agents in 
the architecture. They are made autonomous by correctly defining the interface for these 
agents. They are also autonomous in terms of communication. They communicate using 
10 notifications and not procedure calls, which makes it possible to open up the architecture. 
This communication is asynchronous, thereby making it possible to obtain a better 
parallelization of the indicator agents and hence a better scalability of the architecture. 
*P The indicator agents (Al, A2, Bl, B2) can be persistent objects, and their status is 

I j derived from data that, in this case, can exist outside any execution structure and can be 
If 5 stored in persistent E 2 PROM or hard disk memories. An indicator agent (Al, A2, Bl, B2) is 
|3 an independent machine that reacts to notifications. 

J~ Each agent (Al, A2, Bl, B2) is an object of a class and is identified by a respective 

\J identifier id( A i), id( A 2)> id(Bi), id(e2) that is unique in the computer system (1) so that 
p communications between indicator agents are possible, and that makes it possible to locate 
26 them. An agent can be created in a remote server. The creation entity is responsible for 
i« creating the identifier. The structure of the identifier must take into account the static location 
of the agents in order to allow the system to forward the notifications, and to allow local 
"production" of the identifiers for the remote agents. The identifier of an agent comprises the 
following three parts: 

25 - the identification (idsac) of the agent server hosting the creation agent, 

- the identification (idsal) of the agent server hosting the agent created, and 

- a postmark (el), local to the agent server hosting the creation agent. 

A "factory" agent is present in all of the agent servers and is responsible for creating 
objects in the server It makes it possible to manage the creation of remote agents. 
30 According to the invention, each indicator agent (Al, A2, Bl, B2) is responsible for 

evaluating one indicator (I n ). In order to do this, it must comprise properties that make it 
possible to collect a measurement (M n ) on the resource (101, 102, 201, 202) with which it is 
associated, and communication properties in order to request or receive, as necessary, at least 


one other indicator evaluated by another indicator agent (Al, A2, Bl, B2) according to the 
invention, for example by means of notifications. These properties are obtained by means of 
specific program modules constituting each indicator agent (Al, A2, Bl, B2). Thus, an 
indicator agent (Al, A2, Bl, B2) comprises a collection module and/or a communication 
5 module. Likewise, an indicator agent (Al, A2, Bl, B2) also comprises an evaluation module 
for calculating the indicator itself from the formula, using the information collected by the 
other two modules. In this evaluation module, the equation that defines the value of an 
indicator (I A! for example) can refer to a possibly empty set of other indicators (l A2 , Ibi, for 
example). 

10 The indicator agents according to the invention use a communication protocol that is 

preferably asynchronous, which means that when an indicator agent sends a request for 
iBi collecting a measurement on a resource of the system or a notification to another indicator 

? ess? 

-0 agent, it does not wait for the response to this request or notification before possibly sending 
jsj a new request or notification. In other words, the sending of requests and notifications is 

performed in parallel. Likewise, the management protocol can be asynchronous. For 
15 example, the asynchronous management protocol used is the protocol SNMP or the protocol 

CMIP, and it makes it possible to parallelize the acquisition of the data and the calculation of 
*~ the indicators for which the data is available. 

O However, it is possible to use a synchronous protocol, but the performance in that 

M case is inferior, particularly in terms of speed in evaluating the indicators. Moreover, it is 
l 5 * appropriate to install an asynchronous device for collecting and storing measurements so that 
the indicator agent that sent the request need only consult the memory of this measurement 
collecting device. This device is, for example, a particular management information base 
(MIB), called an Rmon MEB by one skilled in the art. 

indicator agent manages a so-called subscriber list on which the names of other 
indicator agents aSfco^ding to the invention may be written. This list is stored in the storage 
means of the computer equtpiQent unit associated with the indicator agent, for example in the 
form of a table (1010, 2010). An in&rsotor agent Ai is written on this list by sending a 
specific so-called subscription notification ^Jtrbs^ribe (id(Ai) Management Information 
30 (Aj))" to another indicator agent Bj, which calculatestlfe-iadicator (Ibi). This notification 
includes as parameters a piece of so-called management infoirnation^th^tallows the sending 
agent to create an association between a propagation of a value modificationSndLihe other 
indicator agent (B|), and the identifier id( A i). Upon receiving a subscription notificatiofr>4he 
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destination agent (Bl) processes the notification by writing into the subscriber table (2010) 
the Kt^ntifier (Id A i) of the sending agent as well as the management information 
(Manageitaent Information (AO) on the list. This list is consulted by the indicator agent (Bi) 
that manages rafter the evaluation of the indicator (I B i) by the agent (Bi). If the new value 
5 of the indicator is (Afferent from the value previously evaluated and stored by the agent, then 
the agent sends each agbnt written on the subscriber list (2010) and identified by the 
parameter (id( A i)) a value change notification (ValueChanged) comprising the new value 
(Val(I B i)) of the indicator (I B i). T^^o this, after the evaluation of the indicator (Ibi), a 
comparison module of the indicator agfeqt (Bi) compares the new value of the indicator 
10 (Val(I B i)) to the value previously calculateaWalpCfeO) and stored. If the two values are 

different, the comparison module activates a procfe^ure for sending the value change 
;a _ notification (ValueChanged) to all of the agents, for example A, written on the subscriber list 
*0 (2010) then records the new value of the indicator in the st^ge means of its computer 
is] equipment unit. The value change notification comprises as parameters the new value of the 
Jf5 indicator (Val(I B 0) and the management information (Management ^formation (AO) of the 
□ target agent, so that the target agent can assign the value received to the imitator awaited. 

According to the invention, the monitored domain constituted by the se^pf resources 
*3 (101, 102, 201, 202) is organized into a plurality of subdomains (di={ Ai, A 2 }; d 2 ={Bi, B 2 }). 
(3 Each of the subdomains contains either a set of resources to be monitored, and/or a set of 
Ji& subdomains. 

H A subdomain is also characterized by the indicator agents running in the resources 

constituting the subdomain. For each subdomain, a set of agents called "synthesis agents" is 
installed in a particular resource so as to constitute a synthesis node. This resource can be a 
dedicated monitoring resource or a general-purpose resource chosen because of its technical 

25 characteristics to support the synthesis node. Moreover, this resource could also be a given 
resource of the subdomain or of another subdomain. In fact, one resource can support the 
monitoring of several subdomains, for example ranging from 0 to n, if this resource is 
capable of handling the load produced by the calculations performed by a plurality of sets of 
synthesis agents. 

30 Basically, the structure of a synthesis agent is identical to that of an indicator agent; 

the difference in terms of behavior and utilization lies mainly in the formula representing the 
indicator to be evaluated. 
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During the configuration of the monitoring of the monitored domain according to the 
invention, a naming service is used to define and maintain the associations between the name 
of the subdomain and the identification of the indicator agent responsible for evaluating an 
indicator of the subdomain. 
5 In other words, the naming service (SN) supplies, for each subdomain, all of the 

indicator agents present in this subdomain and associates each indicator agent (AO with the 
indicator (I a i) it calculates and with the subdomain (dl) to which it belongs, for example in a 
table (4) or in an association file. This naming service SN is then used during the 
configuration of each synthesis agent. This naming service can be either centralized or 

10 distributed. When the naming service is centralized, as represented in Fig. 1, the information 
relative to the association between a subdomain and an indicator agent is centralized in a 
single place in the monitored domain represented, for example, by a resource of the 

:Q monitored domain. 

5 71 When the naming service is distributed, the naming service associated with each agent 

M3 machine has the names of the agents it manages, as well as the subdomain to which each 
q agent belongs. 

The agent machine manages a certain number of properties, i.e. the atomic aspect of 
□ an agent's reaction to a notification and the delivery of the notifications in a causal sending 
==^ order. 

gf The agent machine handles the distribution of the notifications, the transmission of 

{=£ the notifications and the management of the overall atomicity. The agent machine comprises 

various entities such as an execution engine and a communication channel, this channel 

having two queues, a local queue and an external queue. 

The agent machine is distributed. The agent machines communicate with each other; 
25 the agents do not see this distribution since they communicate through notifications and 

address their local agent machine. An agent's reaction is always local. 

The centralized or distributed naming service SN is necessary for the configuration of 

the synthesis agents (A s ). In essence, as a general rule, a synthesis agent is designed to 

evaluate an indicator whose formula includes at least one set-oriented operator such as a sum, 
30 an average, or the determination of a minimum or a maximum in one or more indicators of 

the subdomain with which the synthesis agent is associated. However, a synthesis agent can 

also evaluate an indicator representing the general operation of the subdomain with which it 

is associated. 
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Thus, the configuration of a synthesis agent requires the latter to subscribe to all the 
indicator agents of the subdomain monitored by the synthesis agent that evaluate the indicator 
or indicators contained in the formula of the indicator of the synthesis agent. To do this, the 
synthesis agent (A s ) sends the naming service (SN), for each indicator required for the 
5 evaluation of its own indicator, a notification requesting an indicator lookup 

(indicatorLookupReq) that specifies the name of the subdomain (dl) that the synthesis agent 
(A s ) monitors, as well as the names of the indicators required for the evaluation of its own 
indicator. In response to this notification, the naming service performs a search in order to 
find the names of all the agents that perform the evaluation of the requested indicator in the 

10 subdomain specified in the notification. The naming service constructs and then sends a 

response notification (indicatorLookupResp) to the requesting synthesis agent containing the 

^ name of the domain (dl) and the name of the indicator agent or agents (Al, A2), if they exist. 

-;y This response notification is processed by the synthesis agent, which extracts the names of 

I /j the indicator agents and activates, for each indicator agent extracted, the subscription 

W§ procedure described above. 

13 In a variant of embodiment, when no indicator agent corresponding to the subdomain 

] r * name/indicator name association exists, the naming service can include means for storing the 
Q name of the synthesis agent that sent the notification and the name of the indicator requested. 
13 Thus, as soon as a new indicator agent is installed in the subdomain, it is detected by the 
2g detection means of the naming service (SN) or is declared to the latter by the installer. The 
naming service includes means for performing an update of the subdomain name/indicator 
name associations and for verifying whether the name of the indicator corresponds to a stored 
indicator name. If so, means in the naming service construct, then send, the response 
notification (indicatorLookupResp) to the agents whose names have been previously stored 
25 that have requested the name of the agent responsible for the new indicator detected. 

In another variant, the naming service uses an agent creation tool, such as an agent 
machine, to construct the agent requested. In other words, the naming service constructs an 
indicator agent that evaluates the requested indicator in the specified subdomain. 

According to the invention, the number of resources per subdomain is lower than a 
30 predetermined maximum number. This number is determined as a function of the monitoring 
policy chosen. For example, the maximum number of resources per machine is determined so 
that the cost of calculating the indicators of the synthesis agents is as low as possible, in order 
to reduce the calculation load in the resource or resources supporting the synthesis agents. 
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Another possibility consists of determining the maximum number of resources per domain so 
that the number of synthesis nodes is as low as possible, in order to reduce the number of 
resources responsible for monitoring and to concentrate the information representing the 
monitoring. 

5 Before further describing the mechanisms that make it possible to reorganize the 

architecture of the monitoring according to the invention, we will describe, in reference to 
Fig. 3, the method that makes it possible to deploy a monitoring configuration. 

The deployment of a monitoring configuration consists of performing the 
instantiation, i.e. the creation, of the indicator agents for the indicators defined by the list 
10 { (dl .11), (di.Ij), (dn.In)}, in which the indicator Ij must be evaluated in the subdomain 
di. To do this, the deployment method uses an agent called a configuration deployment agent 
_ (ADC). This configuration deployment agent handles the creation of agents called 
=0 configuration agents (AC). Thus, for each resource of a monitored subdomain, the 
j J configuration deployment agent (ADC) creates a configuration agent (ACa, ACb), which 

handles the process of creating the indicator agents (Aj, Bj, Q) specified by the configuration 

□ for the resource that has been assigned to it. Thus, the deployment of the various 

J~ configurations is parallelized in each of the subdomains to be monitored. In essence, the 
^ configuration deployment agents are created for each subdomain to be monitored in parallel 

□ fashion. 

The configuration deployment agents (ADC) and the configuration agents (AC) have 
H the same characteristics as the indicator agents, i.e. they are also managed by at least one 
agent machine of at least one resource of the domain to be monitored. The location of the 
configuration agents is of little importance; the various configuration agents can reside in the 
same resource or can be deployed in different resources, including even monitored resources. 
25 As explained above, a configuration agent (AC) handles the creation of the indicator 

agents specified by the configuration for a given resource M belonging to at least one of the 
subdomains (di) for which this indicator I n must be created. To this end, for any indicator I n 
of the configuration that can be instantiated in the resource M, the configuration agent creates 
an agent called an indicator deployment agent (ADI a , ADI b , Fig. 2) responsible for the 
30 deployment of the indicator or indicators I in the resource M. 

The indicator deployment agent (ADI) can be created in the same resource as the 
configuration agent (AC) or in a different resource. 
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In a variant of embodiment in which the programming language used is the "Java" 
language, the configuration agents (AC) will induce the dynamic loading of the classes 
"I_Deployer" and "I_Indicator", using the mechanisms defined by Java runtime. When these 
classes are not present in the resource in which the configuration process is executed, the 
5 configuration agent receives an exception of the "class not found" type 

"ClassNotFoundException", which activates, at the configuration agent level, means for 
downloading the software elements it requires in the resource, thereby incrementally 
deploying the software elements required for the monitoring, from a minimal kernel. Thus, 
the monitoring method according to the invention provides a solution to the dual problem of 

10 configuring a distributed monitoring, i.e., deploying the software configuration and 
configuring the monitoring. 

isa An indicator deployment agent (ADI) is an agent that determines, for a given type of 

^0 indicator, the various combinations of the values of the variables for which the indicator will 

i J be instantiated. It therefore handles both the name resolution process (described below) and 

¥5 the creation of the indicator agents, as well as their declaration to the naming service (SN). 

p The names of the objects referenced by the indicator agent that calculates the 

indicators, as well as the identifications of the agents that calculate the indicators referenced 

^ during the calculation, are part of the parameters for the creation of any indicator agent 

13 during its instantiation by the indicator deployment agent. 

For any indicator I n , an indicator compiler, after analyzing the equation that defines 

1=5= the indicator, generates two object classes "I_Deployer" and "I_Indicator", which respectively 
correspond to the indicator deployment agents that deploy the instances of the class 
"I ^Indicator" responsible for evaluating the indicator and to the indicator agents that evaluate 
the indicator. The class "I_Deployer" makes it possible to know which indicator agents 

25 identified by the class "I_Indicator" must be created, and makes it possible to declare to the 
naming service (SN) the indicator agents actually created. 

An indicator deployment agent has explicit knowledge of the indicators and the 
identifiers of the objects referenced by the equation. Each of these object identifiers {Idi, 
Id m } defines a structure wherein certain elements can be variables. 

30 When the management protocol chosen is the protocol SNMP, the indicator 

deployment agent executes the process (described below) for resolving the names of the 
objects referenced in the equation or the formula of the indicator and creates the 
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corresponding indicator agents by determining the valid combinations of the values of the 
variables. 

In essence, the equation that defines the calculation of the value of an indicator refers 
to objects identified by {Idi, Id m } using a possibly empty set of variables {Vi, V n }. 
5 Each object identifier Idj is associated with a set of variables { W 1? ... W k } belonging to the 
set { V[, . . ., V n }. Determining the first valid combination of the values of the variables Vi 
consists of applying a process (described below) for searching through the identifiers Idj, for 
example in the order 1 through m, in order to progressively instantiate all the variables { Vi, 
V n } and hence to calculate the identifiers { Idj , . .., Id m }. 
10 When all of the objects identified by {Id], Id m } in which the variables have been 

replaced with their corresponding values belonging to the combination of values { Vj, . . V n } 
^ exist, the search process of the indicator deployment agent (ADI) verifies whether or not the 
kQ constraint expressed in the values of the variables is satisfied. The indicator deployment agent 
i J (ADI) instantiates the indicator agent only when this last constraint is met. The objects 
jf5 actually referenced by the equation of the indicator are those identified by {Idj, Id m } in 
□ which the variables have been replaced by their corresponding values belonging to the 
7" combination of values { Vj, V n }. These objects are passed to the indicator agent as a 
^ parameters during the creation of the indicator agent. 

p In order to find the subsequent valid combinations of the variables Vi, the search 

26 process of the indicator deployment agent (ADI) searches for the index k for which there 
H exists a subsequent element for Id k , for k varying between m and 1 . If such a value of k does 

not exist, then the search process is terminated. 

If such a value of k does exist, then the search process is applied to the identifiers Idj, 

in the order k+1 through m for example, in order to progressively instantiate all the variables 
25 { V i , . . . , V n } and hence to calculate the identifiers { Id i , . . . , Id m } . 

The search process therefore makes it possible to calculate all the objects {Idi, ..Id n } 

of the equation that represent an indicator. 

The search process comprises a process for binding the variables, defined as follows, 

which makes it possible to determine the various variables of the identifiers Idi. Let Id be an 
30 object identifier in which the variables { Wj, ...W k } belong to the set { V|, V n } mentioned 

above. The identifier Id has the form a1.W1.a2.W2 a k .W k . ak +i, in which the coefficients aj 

have the structure of an object name in the ASN.l sense of the term, which means that the 

coefficients ai are constituted by a sequence of positive integers. 


Let j be the index for which the variables with successive indices {Wi, Wj} are 
already bound. Finding all of the objects that can be designated by this identifier consists of 
navigating through the instantiated MIB, searching for the objects, for example by launching 
a request of the SNMP "GetNextRequest" type, using as a parameter an object identifier 

5 whose identifiers begin with the root R = a1.W1.a2.W2 aj.Wj and by applying an object 

identifier unifiability criterion, defined below. Let Id* be the response to the SNMP request 
GetNextRequest applied to the root R, if such an object exists. The object does not exist if the 
response to the request GetNextRequest indicates that the designated object does not exist, in 
which case the search is terminated. 

10 If Id and Id' are unifiable according to the criterion defined below, then the variables 

{Wj, . .., Wk} are all bound, which means that they all have a value, and we have just found 
the identifier of an object that can be designated by Id: Id'. 

■*Q In order to find the next valid combination of the variables { W I? .. ., Wk}, knowing 

I J Id', it is necessary to reiterate the preceding process, using Id' as the starting point for the 

K search, after having unbound, i.e., retrieved the previous values of, the variables Wj that were 

O not bound at the very beginning of the preceding search process. 

If Id and Id' are not unifiable and the identifier Id' begins with R, then the search 

^ continues as before, using Id 1 as the starting point and unbinding any variable Wi that was not 

0 bound at the very beginning of the preceding search process. 

"35 If Id' does not begin with R, then there is no longer any existing object whose 

§^ identifier could be unified with Id. The search process is then terminated. 

The unifiability criterion is defined as follows. Let Id' be a reference identifier 
(without a variable part) wherein all the elements are integer values; two identifiers Id and Id' 
are unifiable if both identifiers are the same size and if the identifiers Id.ai = Id'.ai for any i 
25 belonging to [1, k+1]. Let {wi, w^} be the integer values of the identifier Id' that 

correspond to the variables {W h . .., W k } of Id, i.e. that are located at the same position in the 
corresponding identifier. The binding of the variables Wi takes place sequentially for indices 
varying between 1 and k through the following method: 

if Wj is bound, i.e., if Wj already has a value, then this value must be equal to w^ if 
30 the latter condition is not satisfied, then Id and Id* are not unifiable and the binding of the 
variables {Wi, ...W k } is not possible; 

if Wj is free, i.e., if Wj does not yet have a value, then the variable Wj will be bound to 
the value Wj. 
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For example, let Id = 1.2.3.W,.4.5.W 2 .6.7 and Id 1 = 1.2.3.10.4.5.20.6.7. These two 
object identifiers are unifiable for variables of Wi and W 2 whose values are 10 and 20. On the 
other hand, if Id = 1.2.3.W, .4.5.Wj.6.7 and Id' = 1.2.3.10.4.5.20.6.7, then Id and Id* are not 
unifiable because the variable W| cannot have the values 10 and 20 at the same time. 
5 The identifier unification process makes it possible both to verify that two identifiers 

are unifiable and to determine for which values of the variables that are still free prior to the 
start of the unification process this unification is possible. 

When the name resolution process has exhausted all the valid combinations, the 
indicator deployment agent (ADI) becomes unnecessary if the monitored configuration is 
10 never changed. 

When the software and/or hardware configuration of the monitored machine can 
_ change, for example, according to the invention, when the predetermined maximum number 
;0 of machines per subdomain has been reached, the indicator deployment agents are reactivated 
i J in order to re-evaluate the hardware and software configuration by performing a search for 
the instantiable elements, instantiating only those of the indicator agents that don't already 
t3 exist, and deleting those that no longer have any reason to exist. 

~/~ In a variant of embodiment, when the programming language is Java, the creation of a 

^ configuration agent, an indicator deployment agent or an indicator agent consists of sending 
□ an agent creation request "agentCreateRequest" to the "factory" agent of the agent machine 
26 that manages the subdomain in which the agent must be deployed. This request includes as 
H parameters the identification of the agent and the status of the agent. The status of the agent 
corresponds to the serialized Java object of this agent. When the "factory" agent receives this 
request, it deserializes the serialized Java object and thereby obtains the requested agent. 
Annex 1 provides exemplary implementations of various equations that make it 
25 possible to evaluate indicators. 

Fig. 2 represents an exemplary architecture of the links between monitoring agents 
according to the method of the invention. A link is understood to mean an agent's 
subscription to another agent's subscriber list. In this figure, the resources have been 
abstracted. In essence, the chosen installation of the indicator agents and the synthesis agents 
30 depends on the distribution policy, i.e., the criteria of choice for determining which resource 
will be used as the calculation site for this or that indicator. The monitoring of a resource can 
actually be performed, for example, in the resource itself by installing the indicator agent or 
agents in this machine. However, it is possible to perform the monitoring of this same 
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resource from another, remote resource in which the indicator agent or agents for this 
resource are installed. In this case, these remote indicator agents perform the measurements 
required for the evaluation of the indicators by querying SNMP-type agents installed in the 
resource to be monitored. Likewise, a resource can be the installation site for a plurality of 
5 synthesis nodes. In fact, the synthesis agents of these synthesis nodes can be managed by just 
one agent machine. Furthermore, this same agent machine can manage a plurality of indicator 
agents. 

According to Fig. 2, the monitored domain comprises two subdomains (dl, d2). Each 
subdomain (dl, d2) comprises a given number of resources (not represented) monitored by a 
10 plurality of agents (Al.l, A 1.2, A 1.3, Bl.l, B1.2, B1.3). In the exemplary embodiment 

represented in Fig. 2, each agent (Al.l, A 1.2, A 1.3, Bl.l, B 1.2, B 1.3) calculates the same 
^ first indicator (II), but in different resources. The indicator (II) related to the agent (Al.l) is, 
=0 for example, a function requiring the collection of a measurement (m A u) on the resource 
;/i associated with the respective agent (Al.l). For example, the indicator (II) is determined by 
¥5 the formula indicating the number of alarms sent per minute sent a resource. Likewise, each 
O indicator (II) related to the other agents (A1.2, A1.3, Bl.l, B1.2, B1.3) is a function requiring 
"~ the collection of the respective measurements (m A i.2, m A i.3., m B u, m B j.2, niBi.3) in each 
^ resource associated with each indicator agent (A1.2, A1.3, Bl.l, B1.2, B1.3). 
□ Each subdomain (dl, d2) comprises a first synthesis agent (A s .i, As.2) responsible for 

2Q evaluating a second indicator (12). The formula of this indicator (12) corresponds, for 
H example, to the sum of the first indicators (II) in the entire subdomain associated with the 

first synthesis agent (A s .i, A s .2)- Thus, these two agents make it possible to know the number 
of alarms per minute sent by all of the resources of each subdomain (dl, d2). A second, 
supplementary synthesis agent (A s .d) can also be installed and is designed to synthesize the 
25 information from all of the monitored domains. Consequently, this second synthesis agent 
(As.d) is responsible for evaluating a third indicator (13). The formula of this third indicator 
(13) corresponds to the sum of the second indicators (12). This third indicator (13) makes it 
possible to know the number of alarms per minute sent by all of the resources of the 
monitored domain. 

30 During the configuration of the monitored domain represented in Fig. 2, each 

indicator agent (Al.l, A 1.2, A 1.3, Bl.l, B 1.2, B 1.3) is first deployed in one or several 
resources, so as to be able to collect the respective measurements (niAi.i, ttiai.2, niAi.3., mBi.i, 
niBi.2, niBi .3) required for the evaluation of the first indicator (II) by each indicator agent 



(A 1.1, A 1.2, A 1.3, Bl.l, B1.2, B1.3). In a second step, the monitored domain is broken down 
into subdomains (dl, d2), and each agent ( A 1. 1, A 1.2, A 1.3, Bl.l, B 1.2, B 1.3) is assigned to 
a subdomain. In this step, the naming service (SN) updates the list (4) of associations, 
because the new indicator agents created are declared by the deployment agents using a 
5 procedure specific to the naming service by creating an association (indicatorName - 
domainName). 

Next, the first synthesis agents are installed in each synthesis node associated with 
each subdomain (dl, d2). During its configuration, each first synthesis agent will subscribe to 
each indicator agent revealed by the notification (IndicatorLookupResp) as a result of the 
10 procedure described above. In the same way, the second synthesis agent (As.d) will subscribe 
to the first synthesis agents (As.i A s .2). 

In operation, the respective indicator agents will evaluate, for example periodically, 
kQ the first indicator (II) in each associated resource and transmit to the first synthesis agent 
i"l (As.i As.2), if necessary, the new value of the first indicator (II). In the same way, the first 

synthesis agents (A s .i A s .2) will evaluate the second indicator (12) with the last values of the 
p first indicator (II) received and transmit to the second synthesis agent (A s .i A s .2), if 
sp necessary, the new value of the second indicator (12). This second synthesis agent (A s .d) also 
53 evaluates the third indicator (13) with the last values of the second indicator (12) received. 
ih According to the invention, the monitoring method comprises a step for reorganizing 

■W the links between the various agents when the number of resources per subdomain reaches 
!« the predetermined maximum number. 

Given the architecture of the links between indicator agents and synthesis agents, a 
modification of the monitored domain essentially involves the addition and/or deletion of at 
least one resource in at least one subdomain. In other words, a reorganization consists of 
25 modifying the subscriptions of the synthesis agents in the subscriber tables (1010, 1020, 

2010) as a function of the assignment to this or that subdomain of the additional resource or 
resources. 

The addition of a resource to a subdomain follows the following procedure. 
First of all, new indicator agents are installed and configured in the new resource if 
30 the agents have not been previously installed. In essence, when a resource is moved from one 
subdomain (dl) to another subdomain (d2), there is no new installation of indicator agents, 
since they already exist. 
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Next, the configuration agent sends, to the naming service (SN) of the agent machine 
responsible for the supervision of the subdomain (di) to which the resource must be added, an 
agent addition notification (addToDomain) specifying the name of the subdomain and the 
name of the agent added. In response to this notification, the naming service searches for and 
5 determines all of the agents performing evaluations in the subdomain using the indicator 

evaluated by the new indicator agent. The naming service then sends, to each agent found, an 
object addition notification (addltem) that specifies the name of the agent added. 

Each of the agents thus notified modifies its internal status and subscribes to the new 
indicator agent through a subscription notification (Subscribe), then recalculates its value as 
10 soon as possible. 

The deletion of a resource from a subdomain follows the following procedure. 
The naming service (SNi) of the subdomain (di) receives, from the entity that initiated 
*0 the deletion of the resource, a deletion notification (removeFromDomain) that specifies the 
!ij name of the domain in question as well as the name of the resource to be deleted. In response 
y$ to this notification, the naming service (SNi) searches for and retrieves all of the agents 
(3 performing evaluations in the subdomain using the indicator evaluated by the deleted 

indicator agent. The naming service (SNi) sends an object deletion notification (removeltem) 
that specifies the name of the deleted resource, thus allowing the agents to modify their 
O status, to unsubscribe from the indicators, whose references will disappear, and to re -evaluate 
M their value. 

^ As soon as this phase is finished, the indicator agent thus disconnected can, for 

example, be included in another domain by means of the procedure for adding a resource. 

^TTkliuuld b©-cleaLi2^hose skilled in the art that the present invention allows for 
embodiments in many other specific*T5rrns-wdlhout going beyond the scope of application of 

25 the invention as claimed. Consequently, the present enttmdiqjents should be considered as 
examples, but can be modified within the range defined by the scopeofThe-attaohed claims 
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ANNEX 1 


Exemplary implementation of an indicator named ifUtilizationBandWidth that does 
not refer to another indicator. The equation for this indicator is as follows: 

class IfUtilizationBandWidth_Indicator extends Primitivelndicator 
{ 

int delta_l=0; 

public IfUtilizationBandWidth__Indicator( short serverld, 

String machineName, 
int period, 
Oid[] attributes, 
Agentld[] indicators 

) 

throws UnknownHostException, IOException 

{ 

super( serverld, machineName, period, attributes, indicators ); 

} 

public AsnObject evaluate( long deltaT ) 
{ 

int ifInOctets_l= ((AsnUnsInteger)pairs[0]. value). value; 
int ifOutOctets_l= ((AsnUnsInteger)pairsfl]. value). value; 
int ifSpeed_l= ((AsnUnsInteger)pairs[2].value).value; 
int new_delta_l= ifInOctets_l + ifOutOctets_l; 
int old_delta_l=delta_l; 
delta_ 1 =ne w_delta_ 1 ; 

int newValue= ( ( ( ( 8 *(new_delta_l-old_delta_l) ) / (int) deltaT ) / 

ifSpeed_l )* 100); 

return( new Asnlnteger( new Value ) ); 

} 

} 
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public class IfUtilizationBandWidth_Deployer extends IndicatorDeployer 
{ 

private static String ind_patterns[]={ 
Mib2.ifInOctetst + ".1", 
Mib2.ifOutOctets + ".1", 
Mib2.ifSpeed + M" 

}; 

private static IndicatorReferencef] ind_indicators=null; 
private static int ind_period= 1 ; 


public IfUtilizationBandWidth_Deployer( short serverld, 

String machineName, 
String[] machines 

) 

15 throws Exception 

{ super(serverld, machineName, ind_patterns, machines); 
this. serverld = serverld; 
13 t hi s . machi neName=machineName ; 

^0 this.period=ind_period; 
2b indicatorNames=ind_indicators; 
]jj } 

Iji: protected Primitivelndicator createlnstance( short serverld, 

~ String machineName, 

25 int period, 

ig Oid[] attributes, 

S E Agentld[] indicators 

=0 throws UnknownHostException, IOException 

13) { return( new IfUtilizationBandWidth_Indicator( serverld, 

machineName, 
period, attributes, 
indicators)); 

} 

35 } 

This equation uses instantiated indicators iflnOctetsJ, ifOutOctetsJ and ifSpeed.l, 
the names of which are stored in a table used by the builder. 

The evaluation of this equation is performed by a generic method called "evaluate". 
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Exemplary implementation of the indicator named ifErrorsRatio when it refers to 
three other indicators, i.e. ifErrors.l, IflnPacketsl and If OutPackets 1 . The equation for this 
indicator is as follows 


class IfErrorsRatio_Indicator extends Primitivelndicator 
{ 

public IfErrorsRatio_Indicator( short serverld, 

String machineName, 
int period, 
Oid[] attributes, 
Agentld[] indicators 

) 

throws UnknownHostException, IOException 

{ 

super( serverld, machineName, period, attributes, indicators ); 

} 

public AsnObject evaluate( long deltaT ) 
{ 

int IfErrors=((AsnInteger) indicatorValues[0]). value; 
int Mnpackets=((AsnInteger) indicatorValues[l]).value; 
int IfOutpackets=((AsnInteger) indicatorValues[2]). value; 
int newValue= ( IfErrors / ( Iflnpackets + IfOutpackets ) ); 

return( new Asnlnteger( new Value ) ); 

} 

} 


public class IfErrorsRatio_Deployer extends IndicatorDeployer 
{ 

private static String ind_patterns[]=null; 

private static IndicatorReference[] ind_indicators= { 
new IndicatorReference( "IfErrors_Indicator M ) , 
new IndicatorReference( "Iflnpacketsjndicator") , 
new IndicatorReference( "IfOutpackets_Indicator") 

}; 

private static int ind^period = 100 ; 

public IfErrorsRatio_Deployer( short serverld, 

String machineName, 
String[] machines 

) 

throws Exception 

{ 
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super(serverld, machineName, ind_patterns, machines); 

this.serverld = serverld; 

this.machineName=machineName; 

period=ind_period; 

indicatorNames=ind_indicators; 

} 

protected Primitivelndicator createlnstance( short serverld, 

String machineName, 
int period, 
Oid[] attributes, 
Agentld[] indicators) 
throws UnknownHostException, IOException 

{ 

return( new IfErrorsRatio_Indicator( serverld, 
machineName, 
period, 
attributes, 
indicators)); 
} 

} 
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Exemplary implementation of a new iterator, for example SUM, corresponding to a 
sum. The equation for this iterator is as follows: 

public class Sum extends Compoundlndicator 
5 { 

public Sum( int period, String[] machines, Class klass ) 
{ 

super( period, machines, klass ); 
} 

10 

public AsnObject evaluate( long deltaT ) 
{ 

int length = indicatorValues. length; 
int current = 0; 

15 

for( int i = 0; i < length; i++ ) 

!□ { 

?y if( indicatorValues [ i ] != null ) 

N { 

!|§ current += ((Asnlnteger) indicatorValues [ i ]).value; 

l 1 : }; 

5 }; 

'= return( new Asnlnteger( current ) ); 

} 

6 ) 
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